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Abstract 

Scaling laws are powerful summaries of the variations of urban attributes with city size. However, the validity 
of their universal meaning for cities is hampered by the observation that different scaling regimes can be encountered 
for the same territory, time and attribute, depending on the criteria used to delineate cities. The aim of this paper is 
to present new insights concerning this variation, coupled with a sensitivity analysis of urban scaling in France, 
for several socio-economic and infrastructural attributes from data collected exhaustively at the local level. The 
sensitivity analysis considers different aggregations of local units for which data are given by the Population 
Census. We produce a large variety of definitions of cities (approximatively 5000) by aggregating local Census units 
corresponding to the systematic combination of three definitional criteria: density, commuting flows and population 
cutoffs. We then measure the magnitude of scaling estimations and their sensitivity to city definitions for several 
urban indicators, showing for example that simple population cutoffs impact dramatically on the results obtained for 
a given system and attribute. Variations are interpreted with respect to the meaning of the attributes (socio-economic 
descriptors as well as infrastructure) and the urban definitions used (understood as the combination of the three 
criteria). Because of the Modifiable Areal Unit Problem (MAUP) and of the heterogeneous morphologies and social 
landscapes in the cities' internal space, scaling estimations are subject to large variations, distorting many of the 
conclusions on which generative models are based. We conclude that examining scaling variations might be an 
opportunity to understand better the inner composition of cities with regard to their size, i.e. to link the scales of the 
city-system with the system of cities. 


1. Introduction 

U Rban scaling laws are powerful summaries of 
the variations of urban attributes with city size. 
Indeed, when considering the variation of an 
absolute urban quantity Y against total population P in 
a city i, there is almost always a covariation between 
the two ||2T1, frequently in the mathematical form of a 
power law 

Y; = a * Pf 

where a represents a time dependent normalisation 
constant, and f the scaling exponent under enquiry. 

Superlinear relationships (i.e. : f > 1) indicate pos¬ 
itive returns to scale, or a relative concentration in the 
largest cities; whereas sublinearity (f < 1) is associated 
with economies of scale, or a relative concentration in 
the smallest cities. Linear scaling (f ^ 1) means that the 
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quantity per capita is constant across city size. Scaling 
exponents f estimated from empirical data have been 
interpreted as static or evolutionary properties, respec¬ 
tively by Bettencourt et al ||5l[6l and Pumain et al [2^. 
However, they are subject to variations with respect to 
urban delineation |2l, which questions the validity of a 
universal interpretation. 

For example, despite the existence of theoretical mod¬ 
els to predict the value of urban scaling from local in¬ 
teractions BllllllSl, an easy way to argue against the 
universality of scaling exponent values is to look at their 
variation with city definition. For instance, in France, 
there are two definitions of cities defined by the statisti¬ 
cal office INSEE (cf. table|^and figurej^in supplement): 

• Urban Units or Unites Urbaines (UU), which cor¬ 
respond to the aggregation of local units (communes) 
sharing a continuous built-up area of less than 200m 
between buildings, and 
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• Metropolitan areas or Aires Urbaines (AU), defined 
as the aggregation of a central Urban Unit and all the 
communes with more than 40% of active commuters to 
the centre. 

Comparing scaling results from those two official 
definitions, we find not only marginal discrepancies 
between expected values and estimated exponents, but 
evidence of different scaling regimes when we compare 
morphological and functional city delineations (cf. table 
[^, with similar goodness of fits (i.e. quite low for manu¬ 
facturing jobs and relatively high for the other attributes). 
In one case, say employment in the manufacturing sec¬ 
tor, the number of jobs grows more than proportionally 
with the population of density-defined Urban Units, 
whereas the number of such jobs per capita decreases 
with the size of functionally-defined Metropolitan Areas. 

A similarly inconsistent (or paradoxical) result was 
obtained for CO 2 emissions in US cities. In |[T5l p.768], 
Louf and Barthelemy asked: "Faced with these two oppo¬ 
site results, what should one conclude? Our point is that, 
in the absence of a convincing model that accounts for these 
differences and how they arise, nothing. [...] Conclusions can¬ 
not safely be drawn from data analysis alone". The paradox 
obtained from the comparison of city definitions can 
indeed question the very motivation for using urban 
scaling and its empirical analysis. However, even though 
there seems to be no point in trying to fit absolute scal¬ 
ing parameters, the variations in scaling estimation are 
of theoretical interest because of what they say about the 
relation between intra-urban spaces (micro-scale), city 
definitions (meso-scale) and urban scaling (macro-scale). 

Indeed, we consider the variations in scaling esti¬ 
mated between dense cities definitions and metropolitan 
areas not as a failure of a robustness test, but more as 
the expression of the different nature of urban spaces 
implied by the two definitions: the former describes the 
population within a dense environment of social interac¬ 
tions and infrastructure elements; the latter refers to a 
much larger functional space of economic interactions. 
Both can be called cities but they are not equivalent. For 
example, if one was interested in modelling the devel¬ 
opment of road infrastructures or industry locations, 
one would consider different strategies in the central 
and suburban parts of the city, because of differentiated 
opportunities to locate certain types of buildings (in¬ 
dustry for example), because of housing rent gradients 
or because of the urban atmosphere. Therefore, where 
the boundary is set to observe cities with respect to 


scaling is of crucial importance. The boundary concept 
also applies to the minimum population required to 
call a population aggregate urban and there might be 
differences of nature (and quality) between small towns 
and large metropolises with respect to certain indicators. 

An additional motivation to explore the multiple city 
definitions comes from the fact that official definitions 
rely on the choice of unique thresholds (e.g. distance 
between buildings, the percentage of commuters or a 
minimum population). Those have proven useful to 
describe urbanisation over time, but their precise value 
contains a share of arbitrariness that we want to evaluate 
in order to strengthen or question conclusions based on 
these definitions. Finally, varying definitional criteria 
will eventually produce a picture of scaling estimates be¬ 
tween the two official definitions for France and this will 
help us understand better the discrepancies observed 
empirically, as well as to compare studies performed on 
a large number of cities with analyses which analyse the 
upper part of the urban hierarchy only. 

This paper is devoted to the analysis of why and 
when we observe a transition from one scaling regime to 
another through generating a whole range of city defini¬ 
tions; in other words, by rearranging in multiple ways 
the local Census units that compose the different defi¬ 
nitions of cities (section]^. We analyse the variation of 
urban scaling with regard to the criteria used to define 
such cities, and argue that variations are not random 
(section]^. Instead, they can inform our knowledge of 
cities and the different parts they are composed of. We 
suggest a way to describe how discrepancies appear as 
the observed cities vary in their spatial and population 
extent. We finally propose potential explanations to help 
understand better the inner composition and morphol¬ 
ogy of cities at the two geographical levels (section]^. 
Section [Iv| concludes by focusing our interest on using 
urban scaling along with complementary explanations 
(regional, path-dependent, etc.) of the genesis of city 
systems. 

IL A Multiple Representation of the 
French system of cities 

"The extent of the city is important in a number of respects, 
not least in relation to the question of city size, an issue of 
considerable significance in urban and regional analysis" 
p.381]. Since our analysis relates to the variation of at¬ 
tributes with city size, knowing the sensitivity of this 
variation with respect to the spatial criteria for urban 
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Table 1: Scaling exponents for two city definitions in France. 


Urban attribute 

City Definition 


Cl (95%) 

J?2 

N 

Manufacturing 

UU 

1.175 

[1.13;1.22] 

0.543 

2226 

AU 

0.849 

[0.81;0.89] 

0.691 

771 

Vacant Dwellings 

UU 

1.051 

[1.03;1.07] 

0.797 

2233 

AU 

0.902 

[0.88;0.92] 

0.928 

771 

Basic Services 

UU 

1.086 

[1.07;1.10] 

0.892 

2233 

AU 

0.956 

[0.94;0.97] 

0.965 

771 

Education 

UU 

1.215 

[1.19;1.24] 

0.778 

2230 

AU 

0.981 

[0.96;1.00] 

0.922 

771 


Source of the data: French Census, 2011. UU : density-based Urban Units. AU : functionally defined Urban Areas. N: 
Number of cities in the regression. 


delineation is an important aspect. Beyond sensitivity, 
the different parts of the city (centre and periphery for 
example) are not expected to behave similarly vis-a-vis 
economic and infrastructural patterns. Indeed, they 
are composed of very different populations, built envi¬ 
ronments and lifestyles, hence representing an internal 
diversity of physical and social landscapes within the 
city Gol. The inclusion of some or all of these urban 
components might strongly affect our estimation of scal¬ 
ing laws. This argument is usually left out of the scope 
of predictive scaling theories where cities are consid¬ 
ered as homogeneous objects (for example in 0). We 
present three criteria used to delineate cities in France 
(section [^, and the resulting urban clusters generated 
from combining them in a systematic way (section]^, be¬ 
fore comparing results with classical definitions (section 

m- 

I. Systematic criteria for defining cities 

Our "continuous" delineation of cities adapted from 
Arcaute et al m follows a similar strategy to the one 
used in the official identification of cities in France by 
the statistical office (INSEE), that is : identifying a centre 
based on a density criteria to aggregate local units, de¬ 
lineating the periphery functionally associated with that 
centre, based on travel-to-work patterns of each local 
unit, and eventually applying a minimum population 
cutoff. The dense centres are called Urban Units (UU), 
and group communes sharing the same built-up area 
with a maximum distance of 200m between buildings. 
The metropolitan areas (AU), correspond to the aggre¬ 
gation of an UU concentrating more than 1500 jobs and 
with the communes sending more than 40% of their 
commuters to the UU. In order to perform a sensitivity 
analysis of urban scaling, we consider a variety of values 


for each criterion: 

• Urban density is associated with historical and 
morphological centres (or "core cities”), and charac¬ 
terises the part of a city in which the concentration of 
interactions and economic activity is maximised 1191 . 
Densities can be expressed in relation to the number 
of buildings or persons per unit of surface. We choose 
the latter option here, and let the minimum number of 
residents per hectare define dense cities varying from 
1 (loose centres) to 20 (very dense city cores). Popu¬ 
lation densities of official UUs are distributed within 
this range, with the median half of cities in the interval 
[0.9; 3.2] with the mean equal to 5. 

• Over the last two or three decades though, density 
alone has failed to reflect the spatial extent of urban 
labour and housing markets. Therefore, it is common to 
take into account functional indicators of urban activities 
taking place beyond dense city cores 1191191. To define 
such urban aggregates, researchers usually consider 
commuting patterns to the centres. The criterion for 
aggregation can refer to the share of income earned in 
the city core or the share of active residents commuting 
to the city centre. We choose the latter, and explore the 
variation of urban extent when this proportion varies 
from 0 (where a single commuter is sufficient to attach 
a commune to the urban centre) to 100% (where the 
functional city basically corresponds to the dense city). 

• Einally, the population minimum plays a major 
role in the definition of cities, and affects many urban 
measures (most notably Zipf's exponent, cf. ITtIIsIItIL 
This parameter reveals the conceptual trade-off between 
acknowledging that cities appear above a critical mass 
of population and using the larger scope of city sizes 


3 













Paradoxical Interpretations of Urban Scaling Laws • July 2015 • ArXiv 


in the study of systems of cities (when we use scaling 
measures). We present results obtained with population 
cutoffs from 0 (all clusters are considered) to 50,000 
inhabitants. 


II. Resulting "urban" clusters 

The clustering algorithms, developed and described 
in (23, first group contiguous local units that together 
exceed the population density criterion C. A commune 
is then linked to the core that attracts its largest per¬ 
centage of commuters if this percentage is above the 
flow cutoff F. By combining multiple values of C, F 
and a population minimum P, this process leads us to 
consider thousands of representations of the system of 
cities in France. 

In the following sensitivity analysis, we consider 
4914 such representations based on the combinations 
of 39 density values (D from 1 to 20 residents per ha 
in steps equal to 0.5), 21 commuting cutoffs (P from 0 
to 100% in steps equal to 5) and 6 population cutoffs 
(P from 0 to 50,000 inhabitants in steps equal to 10,000). 
Thirty different combinations are shown in figure 


III. Matching with official definitions 

In order to evaluate the quality of our clusters and 
their ability to describe the transition between official 
definitions, we measured the correspondence between 
each generated cluster definition and three classical 
urban delineations: Urban Units in 2010, Metropolitan 
Areas in 2010, and Urbanised Land Use Areas given by 
CORINE LandCover 2006 raster data (cf. table and 
figure [ST] from the supplement). This measure consists 
of the correlation between urbanised local units in the 
official and the cluster definitions (a correlation of 1 
meaning that both definitions match perfectly: each "ur¬ 
ban” local unit by official standards belongs to one of the 
systematic clusters for a given definitional combination). 
By doing this, we check if the method is able to generate 
urban clusters that correspond to the different official 
definitions, in order to study the transition between 
them in an almost continuous way. Given the methods 
used by the statistical office, we expect definitions to 
differ on the commuting criteria, which should be close 
to 100% for the Urban Unit equivalent, and close to 
40% for the Urban Area equivalent, with similar density 
cutoffs. 


Indeed, we find a good match between Urban Units 
and clusters defined as having a density over 1.5 resi¬ 
dents per ha, almost no commuting flows (P = 100%) 
and no population cutoff (table and figure top 
left). The correlation coefficient between belonging to 
such a cluster and belonging to an Urban Unit for each 
commune is 0.66, with an of 44%. For Urban Areas, 
we find the following expected values for definitional 
criteria: a density cutoff of 2.0 (close to that of Urban 
Units), a commuting cutoff of 35% (close to the official 
40%) and a larger population cutoff (10,000 persons). 
However, the match is of weaker quality (R^ = 0.134). 

The previous definitions correspond to definitions 
based on administrative boundaries. An alternative 
but common urban definition comes from the use of 
CORINE LandCover raster data. When computing the 
correlation coefficient between the belonging to a clus¬ 
ter and the % of land classified urban in the CORINE 
images for each commune, we find a match of better 
quality (R^ = 0.58) with clusters defined with D = 4.5 
persons per ha, P = 100% and no population cutoff. 


III. Understanding the variations of 

SCALING BEHAVIOURS 

Building on an almost continuous representation of sys¬ 
tems of cities, we are able to estimate scaling laws and 
their sensitivity to the variation of the urban definition 
criteria. Evaluating the sensitivity of scaling provides an 
opportunity to identify where scaling regime transitions 
might take place and to characterise the types of cities to 
which it corresponds. This will help build models that 
could account for the emergence of such scaling (and 
varying) behaviours, based on a non-uniform internal 
morphology of the cities considered. 

The variables used to describe urban attributes in 
the following analysis have been collected at the local 
level (c^ 36000 local units in continental France). Socio¬ 
economic and travel-to-work data come from the last 
Population Census and surveys in 2011, whereas land- 
use data are extracted from CORINE LandCover 2006 
raster data, road length derive from an Open Street Map 
dataset computed by C. Quest in 2014, and housing 
permits come from governmental open data. A more 
detailed description is available in the supplementary 
materials (section 
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Figure 1: A wide range of representation of systems of cities in France (P = 0) 





D is the minimum density of residents per ha which defines urban centres (in red). C indicates the share of 
Commuters (in %) living in the periphery and working in the density-based urban clusters (in green). P, the 
population minimum, is here set to 0. Aggregation is performed using the 2013 GeoFla geometry of communes, 
https://www.data.gouv.fr/fr/datasets/geofla-communes/ 


5 




Paradoxical Interpretations of Urban Scaling Laws • July 2015 • ArXiv 


Table 2: Three depictions of city definitions (top) and their corresponding urban clusters (bottom). The represents the level of correspondence 
between each pair and n the number of cities considered. 


City definitions 

Unites Urbaines 

Aires Urbaines 

CORINE LandCover 



A ^ 

Corresponding Urban Clusters 

D = 1.5,F = 100,P = 0 

D = 2,f = 35,P = 10,000 

D = 4.5,f = 100,P = 0 

mm 

* *' 

*T ■- 

= 0.436 1 n = 1173 

= 0.134 1 n = 309 

R^ = 0.580 1 n = 519 


Source. INSEE: www. insee. f r, CORINE LandCover 2006 raster data (Urban categories of land use No. Ill and 112): 
http://www.statistiques.developpement-durable.gouv.fr/donnees-ligne/li/1825.html 


1. Variations of scaling behaviours 

Eigure represents the distribution of the estimated 
j6 for the 4914 representations of the Erench ur¬ 
ban system. We used a kernel density estimation 
(http://www.inside-r.org/r-doc/stats/density) to 
ease visual comparisons across attributes and cluster 
definitions. We prefer this representation to histograms 
in this case because it allows us not to fix any bin num¬ 
ber or size, which would have taken different values for 
each distribution. The kernel procedure, on the contrary, 
makes the representation of the distributions continuous 
and comparable. This analysis of density estimations 
can be summarised by two findings. 

Eirst, we find substantial variations in the scaling 
exponents measured for the different representations 
of the system of cities. The maximum intervals of esti¬ 


mated exponents often range from sub-linear (j6 < 1) to 
superlinear regimes (j6 > 1). Eor example, the area of 
urban clusters (figure]^ top left) scales from sublinearity 
(j6 = 0.33) to superlinearity = 1.29) with population. 
The length of paths and roads is the second most volatile 
variable with respect to urban scaling, as the exponents 
estimated range from j6 = 0.66 to j6 = 1.20. These results 
are not so surprising as the two variables are physical 
and therefore highly dependent on the spatial definition 
of cities. However, many other variables (such as the 
number of jobs in the education or the manufacturing 
sectors, the number of hospitals, etc.) are also affected 
by the choice of urban criteria. To summarise, with 
the exception of the number of dwellings which clearly 
scales linearly with population (0.95 < jS < 1.03), all the 
urban attributes considered in the study range across 
two or more scaling regimes and cannot be classified in 
a definitive way by a single value for all possible urban 
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Figure 2: The distribution of scaling exponents for selected attributes over the entire set of city definitions 
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Kernel Density Estimation (KDE) of scaling exponents of 15 urban attributes from 4914 systems of clusters-cities 
resulting from the combination of three definition criteria. N.B. : the number of clusters in each regression varies, 
from 1298 to 48, depending on the restrictive character of the definition criteria 
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definitions. 

The second finding therefore relates to the differ¬ 
ent magnitudes of variations observed for the different 
variables under study. If only the number of dwellings is 
stable over the complete range of city definitions, some 
attributes appear stable in relation to population for a 
majority of definitional criteria combinations (in terms 
of the scaling regime rather than the specific value of 
the exponent). For instance, the number of households 
owning a car (j6 G [0.94,1.03]. N.B. this interval corre¬ 
sponds to the amplitude of estimation across definitions, 
not a confidence interval) and jobs in proximity ser¬ 
vices (j6 G [0.95,1.07]) scale linearly with population 
most of the time. Similarly, the number of hospitals, of 
persons employed as "workers” and the urbanised area 
scale sublinearly in a majority of representations of the 
system of cities (j6 belongs respectively to the intervals 
[0.63;1.02], [0.87;1.01] & [0.81;1.05]). The number of per¬ 
sons employed as managers and professionals, the jobs 
in finance or in research are symmetrically mostly su- 
perlinear urban attributes (j6 magnitude of respectively 
[1.02;1.27], [0.98;1.21] & [0.95;1.5]). Such behaviours are 
consistent with results obtained with ordinary repre¬ 
sentations of systems of cities ||20l|5|. However, given 
particular criteria for urban definition, the behaviour 
of these attributes with city size would change to the 
opposite scaling regime. Finally, bimodal distributions 
are clearly noticeable from figure on both sides of the 
linear value where j6 = 1 : e.g. the number of jobs in 
manufacture ([0.83;1.25]), in health and social services 
([0.9;1.17]) and in education ([0.92;1.14]). These features 
suggest that there might be two sets of urban defini¬ 
tional criteria generating two scaling regimes. 

This observation leads us to inquire much more into the 
determinants of variations in urban scaling. 

11. What makes urban scaling vary ? 

In this section, we look for systematic variations of ^ 
with the definitional criteria, and ways to interpret and 
explain them. We proceed in four steps : 

• Building a typology of cluster definitions and compare 
their distributions of j6 (section [IlT| , 

• Using heatmap representations to visualise and com¬ 
pare variations of urban scaling with definitional criteria 
for the different attributes, producing representations 
similar to a phase diagram (section [IL2| 

• Grouping the most similar heatmaps with hierarchical 
clustering (section [IL3| and 

• Comparing extreme estimations with extreme observa¬ 
tions from the literature (section |IL4| ). 


ILl Clusters Typology 

Our first approach is to differentiate urban clusters 
based on their spatial extent and population cutoff. 

We partition our set of urban realisations into two 
subsets: the first represents the ones which are similar to 
existing definitions of cities in France and we call them 
common clusters (cf. Supplementary material, figure 
[ST) . Alternative clusters represent definitions which 
deviate from the usual representations of the French 
system of cities, yet are within reasonable bounds for 
spatial criteria. Common clusters are defined as centres 
with population densities ranging from 1 to 5 residents 
per ha (57% of Urban Units belong to this interval) and 
peripheries made of local units from which 35 to 100% 
of the active population commutes to the dense centres 
(the proportion is 40% for the definition of Metropolitan 
Areas and virtually 100% for Urban Units). Alternative 
clusters represent all remaining definitions at a given 
population cutoff (cf. figures and [ST] ). Population 
cutoffs do not affect the spatial extension of cities, but 
drastically change the number of cities considered (the 
higher the cutoff, the lower their number). For example, 
with the same criteria of D = 1.5 & F = 100%, there 
are 1172 urban clusters in France, but less than a third 
of them have more than 10,000 residents (300), and 
only 98 meet the P > 50,000 definition (for D = 2 and 
F = 30, the actual numbers are respectively 991, 341 & 
138). Therefore, among common and alternative clusters, 
we differentiate between definitions with a population 
cutoff and definitions without. 

We find that this first categorisation eases the in¬ 
terpretation of the variations of the scaling exponents, 
especially in the case of bimodal distributions (figure]^. 
For example, the two modes in the distribution of the 
scaling exponents for jobs in the manufacturing sector 
corresponds to two types of definitions: the ones with a 
population cutoff and the ones without (independently 
from the spatial extent of cities, common or alternative). 
In other words, one finds that manufacturing jobs scale 
superlinearly with city population when all cities are 
considered, especially very small aggregates. On the 
contrary, when one sets a minimal size for cities to 
be considered, they scale sublinearly to linearly with 
population (fourth column, figure]^. This result tells us 
that manufacturing is neither a specialisation of small 
nor large cities: the bimodal distribution and the transi¬ 
tion in scaling regimes could instead indicate either the 
importance of medium size cities in the concentration of 
manufacturing jobs or the high threshold in population 


8 



KDE KDE KDE 


Paradoxical Interpretations of Urban Scaling Laws • July 2015 • ArXiv 


Table 3: The number of urban realizations within each typological group 


Definitions 

Population cutoff 

No cutoff 

Total 

Common clusters 


126 

756 

Alternative clusters 


693 

4158 

Total 

4095 i 

819 

4914 


Common clusters :l<D<5&35<f< 100. 
Population cutoffs : 10,000; 20,000; 30,000; 40,000; 50,000. 


Figure 3: The distribution of scaling exponents for selected attributes over 


the four typologies. Legend : cf table 
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for economies of scale to appear. The same pattern 
holds for the education and the health and social sectors, 
even though the picture is less clear-cut for alternative 
clusters. In the case of financial and managerial jobs, the 
application of a population cutoff does not change the 
scaling regime but clearly lowers the value of scaling 
exponent j6, from around 1.2 to around 1.1. This might 
begin to explain why mixed evidence appears in the lit¬ 
erature, depending only on the size of cities considered 
in the analysis. The opposite holds true for professional 
and managerial occupations. 

For some urban indicators (the length and surface 
variables for instance), the scaling behaviour does not 
respond monotonically to the application of a popula¬ 
tion minimum (first column, figure]^. For the majority 
of others, however, this definitional criterion appears to 
be the most important to understand variations in the 
scaling exponents for most urban indicators, more than 
matters in terms of the spatial extent of cities. 

IL2 Scaling heatmaps 

To provide a more detailed account of the sensitivity of 
attribute scaling with respect to the definitional criteria 
and their combination, we use multiple representations 
of 2D heatmaps. Figure for instance shows scaling 
exponents and values for several indicators with pop¬ 
ulation for 3276 definitions. In the case of the road 
length (figure]^ top left), the interplay of the three defini¬ 
tional criteria produces differentiated scaling behaviours. 
Also, not only do the scaling exponent values vary : the 
scaling regime (sub- or super-linear) depends on the 
combination of density, commuting flows and popula¬ 
tion cutoff. Indeed, although this variable is known to be 
sublinear from previous reports in the literature (e.g. in 
= 0.667, and in IIT6ll j6 = 0.86), we show that when 
relatively dense urban clusters (> 5 residents per ha) 
with significant peripheries (flow cutoff < 50%) and low 
population cutoffs (< 20,000) are defined, superlinear 
behaviours are encountered. This behaviour is hidden 
over a population cutoff of 50,000 inhabitants, but the 
quality of the regressions also lowers as the population 
cutoff increases. This means that, in contradiction with 
some empirical evidence and model predictions, the 
road length per capita is higher in larger urban aggre¬ 
gates (corresponding to the spatial extents at the bottom 
right corner of figure]^. The second heatmap (figure 
top right) shows the relatively weak importance of 
flow and population cutoffs on the estimation of urban 
scaling for hospitals up to 50,000 persons, and the strong 
variation due to density over this threshold (towards a 


linear pattern). Finally, figure bottom left and right 
reveal cases of stability of scaling behaviour associated 
with robust models : > 90% (again, stability here 

refers to the scaling regime, not the value of the exponent 
which may vary substantially). The number of dwellings 
scales linearly with population in any combination of ur¬ 
ban definition, whereas the number of professionals and 
managers scales superlinearly, especially when cities are 
defined using high density, high commuting integration 
and high population cutoffs. 

IL3 Hierarchical clustering 

In order to summarise the proximity between values 
and variations of scaling exponents of 20 different ur¬ 
ban attributes, we performed a hierarchical clustering 
of heatmaps for each population cutoff (figure]^. The j6 
estimated on the clusters defined by the combination of 
21 density cutoffs and 39 commuter cutoffs represent the 
variables (819 columns) describing each urban attribute 
for each population cutoff (the 126 individuals or rows). 
The clustering procedure therefore distinguishes groups 
of indicators whose scaling behaviour responds the same 
way to the city definitional variations. A typology of 9 
classes corresponds to clear cuts in the clustering tree 
(dendrogram) and cover 47.1% of the total variance. 

Class 1: monotonically linear attributes. The first 
class in this typology groups together attributes that 
scale proportionately with population (0.95 < ^ < 1.05) 
for all values of density and commuting criteria. In the 
case of the number of dwellings, of firms, of households 
owning a car and of jobs in the proximity service sector, 
the scaling exponent measured is linear irrespective of 
the minimum population cutoff (figure]^. At high popu¬ 
lation cutoffs (i.e. when only large cities are considered), 
jobs in the educational, health and social sectors also 
belong to this class. At low population cutoffs, the num¬ 
ber of "workers”, manufacturing jobs, vacant dwellings 
and universities also tend to be linear urban attributes. 
These results relate to the interpretations of |20l and ||6l 
concerning indicators of "mature" industries and "basic 
needs" proportional to urban populations. Following 
these researchers, linear attributes describe industries 
mature enough not to require increasing returns to scale 
to compensate for innovation costs, or goods and ser¬ 
vices proportionately distributed among city dwellers, 
since everyone needs a comparable amount in every city 
of the system (in terms of dwellings, for example). 
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Figure 4: Heatmaps of scaling exponents and goodness of fit across city definitions 
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Figure 5: Clustering ofheatmaps across city definitions 
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Class 2, 3 & 6: sub-linear attributes. In the types 2, 
3 & 6, we find linear to sublinear attributes such as 
infrastructure, morphological and physical indicators 
(number of train stations, road length, urbanised area, 
number of vacant dwellings and hospitals) as well as 
"obsolete” ||2Q|| industries (manufacturing) that scale 
overall in the way predicted in the formerly cited the¬ 
ories: they grow slower than proportionately with city 
populations, revealing economies of scale (for physical 
attributes) and specialisation of small cities (for obso¬ 
lete industries). However, these three classes do not 
appear monotonic in their response to changes in city 
definitions. For example, the attributes are distributed 
in various classes depending on the population cutoff 
applied. Train stations for example scale much more lin¬ 
early among cities of more than 20,000 inhabitants (class 
2) than when cities above 10,000 (class 3) or less (class 
6 ) are taken into account. Indeed, all towns usually 
contain a local train station in France, leading to a very 
low exponent associated with no population cutoff (or 
PO, class 6), especially when only urban city cores are 
considered without their functional periphery (i.e. high 
commuting cutoffs). Among large cities, potentially 
those with more than one train station, the behaviour 
becomes more linear. High sublinearity is therefore 
more of an artefact due to complete provision in small 
towns, while the number of stations is indeed almost 
proportional to the number of city residents. 


Class 4, 5 & 9: super-linear attributes. As observed in 
figure]^ research, finance and management jobs, along 
with professional occupations, collective housing and 
universities scale linearly to superlinearly, i.e. among 
classes 4, 5 & 9. We also find attributes that belong to the 
superlinear class 5 on the heatmaps at low population 
cutoffs (Education, health and social jobs, number of 
employees), while their scaling behaviour under higher 
cutoffs is monotonically linear (cf. class 1). For these 
indicators, the more restrictive the delineation of city 
centres (i.e. small periphery and large population), the 
more linearly they grow with population. Therefore, the 
superlinear scaling observed in class 5 for weak flows 
and low population cutoffs reveals a higher provision 
in public services and employment opportunities in 
urban spaces compared to rural ones, rather than a size 
differentiation among cities. 


Class 7 & 8: mixed attributes. Classes 7 & 8 represent 
non-monotonic regimes of urban scaling (of particu¬ 


lar interest to this paper). Class 7 groups heatmaps 
for road length, area and individual housing at low 
population cutoffs, and show a combination of three 
regimes: sublinear (for commuting flows F > 50%); lin¬ 
ear (for F < 50% & D < 10 persons per ha); superlinear 
(F < 50% & D > 10). When we refer to figure]^ this 
means that those indicators of suburban morphology 
grow less than proportionately with population in city 
centres (top half of the figure), proportionately in vast 
configuration of cities that represent most of the French 
territory (bottom left), and more than proportionately 
with population when cities are considered as high den¬ 
sity kernels surrounded by large peripheries (bottom 
right). This obvious conclusion highlights the fact that 
in the case of physical urban attributes, the estimation 
of scaling exponents is directly related to the choice of 
city definition. Class 8 comprises only area heatmaps 
for high population cutoffs and is to a lesser extent af¬ 
fected by the density criterion to define city centres. The 
resulting picture is somehow opposite to class 7 where 
area scales linearly to superlinearly in city centres (high 
f) and sublinearly in systems with vast peripheries (low 
F). In other words: large metropolitan areas get denser 
as they grow, while the surface occupied by city centres 
is more or less proportional to population (probably 
revealing areas occupied by offices and large infrastruc¬ 
tures compensating for higher residential density). 

An interesting feature of these mixed classes is the 
fact that the transition between the two regimes happens 
around the flow cutoff value of 40%, the one chosen by 
INSEE to define peripheries of metropolitan areas (AU). 
This cutoff corresponds mostly to linear behaviour in 
the classes, but also to a transition space between two 
radically different scaling regimes. The linear scaling at 
this value therefore hides high variability when cutoffs 
are slightly pertubed. It is thus not robust to city defi¬ 
nition, indicating that either 40% is "the true value for 
cities", or more probably that there is no interpretation 
of the scaling of these variables independently from the 
criteria used to define cities. 


Several types at different population cutoffs: manu¬ 
facturing jobs. A special case is that of manufacture. 
The scaling heatmaps for this indicator are found in 
linear, sub- and superlinear classes according to which 
population cutoff is considered. Scaling laws are sup¬ 
posed to account for regular variations across several 
orders of magnitude, and specifically model the distri¬ 
bution of attributes of cities of different population sizes. 
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These results for manufacture therefore suggest that the 
power law adjustment might not be the most interesting 
one to describe the evolution of manufacturing jobs with 
city population. It also suggests that other factors of 
explanation are necessary to understand the distribution 
of manufacturing jobs in French cities, such as resource 
and path-dependency, regional particularities and eco¬ 
nomic cycles (cf. section]^ and residual analysis from 
section ^nl|in supplement). 

Clustering summary. The clustering of heatmaps has 
thus provided a synthetic way to describe the dominant 
regime and variation of scaling measures of 20 indicators 
with respect to urban definitional criteria. The amplitude 
of variations appears marginal for many indicators for 
which the dominant scaling regime corresponds to the 
one predicted theoretically or from ordinary definitions 
of cities. We also identified groups of urban attributes 
for which the variation of the scaling exponent depends 
quantitatively on density and flow cutoffs. Physical 
attributes for instance are not independent from the 
spatial definitions of cities. There, scaling behaviour 
appears impossible to characterise independently from 
city definitional criteria. On the other hand, we found 
that social and public services seem to exhibit constant 
returns to scale over a minimum threshold of popula¬ 
tion. Finally, manufacturing jobs have been found to be 
loosely linked to population and are better described 
by other urban features (especially with respect to their 
history of early diffusion in north-eastern France). 

A last attempt at understanding and explaining varia¬ 
tions in scaling comes from the confrontation of extreme 
values recorded in the literature and by our methodol¬ 
ogy- 

IL4 Extreme Scaling 

This last section examines the extreme cases of scaling 
measured on systems of cities: in the literature and with 
a systematic definition. Table gives the maximum 
intervals of scaling exponents found in the literature 
and in this study among the 4914 combinations of defi¬ 
nitional criteria. 

The total area of cities is an interesting example 
of such extreme variations reported in the literature 
as well as in our own study. The lowest scaling expo¬ 
nents measured are very low (j6 = 0.3) and represent 
extremely sublinear behaviour compared to the mini¬ 
mum values we found in the literature (0.676 in ||3i|). In 


the case of contemporary urban France, we found this 
minimum value for a very restrictive definition of cities 
corresponding to narrow centres without peripheries 
(D = 13.5,F = 100, P = 0\N = 202). On the contrary, a 
superlinear regime (j6 = 1.2) comparable to that found 
by Veregin and Tobler for 366 US cities in 1980 corre¬ 
sponds to a relatively common definition of metropoli¬ 
tan areas (D = 7.5,P = 45,P = 10,000|N = 200). In 
this case, the dramatic variation of scaling estimation 
depends on the way cities are spatially defined, as they 
mechanistically affect the ratio of surface per inhabitant. 

For finance and lifestyle measures such as the ur¬ 
banised area and the number of households with a car, 
the maximum variations happen with respect to the 
way the periphery is taken into account. This seems 
consistent with how morphology and lifestyle interact: 
suburban spaces are dependent on the use of car, and 
therefore this variable is the closest to superlinearity in 
a definition of cities that comprises large peripheries 
(^ = 1.03|D = 20,P = 5,P = 10,000|N = 98) and the 
closest to sublinearity in narrow centres with almost no 
commuting (j6 = 0.94 |D = 17.5, P = 95, P = 50,000|N = 
50). The interesting insight provided here by looking 
at the scaling behaviour is that the low consumption of 
cars and housing per capita in city centres is reinforced 
as they grow in size. On the contrary, peripheries tend 
to exacerbate suburban lifestyles when they belong to 
larger cities. 

In the case of research and education, the distinction 
between min and max j6 is associated with the density 
dimension (the way city centres are defined). Those 
urban attributes are known to be strongly linked to the 
density of interactions ||5[|ll. What this analysis brings 
is an insight into extreme behaviours not reported so 
far in the literature (such as superlinear scaling for 
roads and sublinear scaling for research under extensive 
definitions probably comprising rural spaces). 

The number of jobs in proximity services appears 
different from the previous category as the minimum 
and maximum scaling values are opposed in the way 
the ratio of centre and periphery is defined. Indeed, this 
variable scales sublinearly when clusters correspond 
to large dense centres with a restricted periphery (j6 = 
0.951D = 5,P = 55,P = 20,000IN = 163) and superlin- 
early when city cores are very narrow and peripheries 
extensive (^ = 1.07|D = 16,P = 0,P = 10,000|N = 142). 
This could mean that, although basic services are subject 
to economies of scale in the largest cities, they are not 
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equally profitable in suburban spaces and are provided 
more systematically in the peripheries of large cities 
rather than in the periphery of smaller ones. 

Finally, health and social services jobs show extreme 
scaling behaviour under similar spatial definitions of 
cities, but at different population cutoffs. This urban at¬ 
tribute is the most sublinear at the top of the urban hier¬ 
archy = 0.901D = 4,f = 65,P = 50,0001N = 88) and 
most superlinear when the entire spectrum of city size is 
considered = 1.17|D = 1.5,F = 95,P = 0|N = 1094). 
This might suggest the existence of a critical size to 
provide such services and their subsequent economies 
of scale. 


IV. Conclusion : from quantitative to 

QUALITATIVE CHANGES IN THE URBAN 
HIERARCHY ? 

"Whether a particular class of prosocial behavior scales lin¬ 
early, superlinearly, or sublinearly might he dependent on 
two factors: the locality of one's interactions, and feedback 
of interaction. For locality, this refers to whether or not you 
are limited to the individuals around you, or can interact 
with anyone in the city. [...] The potential for understanding 
the spectrum of scaling of prosocial behaviours points to 
the wide variety of aspects of seemingly related behaviours. 
Their implications for urban growth are intriguing and merit 
further examination" HI p.2158] 

Scaling laws were first proposed as an interesting 
tool to summarise distributions of characteristics in a 
system of cities with respect to city size over several 
orders of magnitude. It lies on the conception that 
cities share common attributes across a wide size spec¬ 
trum. This proposition finds roots in the long quest 
to find what makes cities identifiable (specifically ur¬ 
ban features). Social scientists agree on the fact that 
we recognise cities by their function as social interac¬ 
tions maximisers. They do so by concentrating a larger 
number of heterogeneous social agents in a limited 
space (hence, dense). Those cities take part in a system 
of cities at a larger scale, characterised by a regular 
hierarchy of sizes, competition (in space resulting in 
regular spatial patterns of settlements) and cooperation 
(resulting in complementary profiles of economic and 
functional specialisation) |[20|. Therefore, the analysis of 
systems of cities should not be restricted to large cities 
only (a constraint generally imposed by data), as small 
cities are an essential part of urban systems. 


Regimes of sub- and superlinearity indicate quantita¬ 
tive changes which occur with size variations. Moreover, 
several results from this paper indicate the existence of 
high variability of scaling exponents with respect to vari¬ 
ations in city definition. The transition from one scaling 
regime to another when population cutoffs vary was 
found to be the most important criterion in most cases 
(and the easiest to harmonise in comparative studies). 
For example, if we go back to the paradoxes implied in 
table we find that scaling behaviours with the two 
official definitions start to match when a population 
cutoff is applied (cf. figure]^. Over 50,000 residents, 
city centres behave similarly to metropolitan areas with 
respect to socio-economic criteria. The paradox appears 
mostly due to the large dispersion encountered among 
small cities (cf. figure |^. These deviations from the 
power law adjustment are neither random errors nor 
systematic biases for particular cities, for they reveal 
factors unrelated to city size that play an important role 
in the explanation of the attribute location, for example: 
coastal accessibility for secondary houses, regional dif¬ 
fusion in manufacturing, etc. (cf. residual maps from 
figure in supplement). 

The main finding of this paper is that urban scaling is 
relative to the definition of cities, and most importantly 
that variations with respect to definitional criteria are 
neither random (since residuals can be interpreted on 
a case by case basis) nor universal for all the variables 
under study. Instead, some attributes are more sensitive 
to the spatial delineation of centres and peripheries 
(interaction-based activity, life-style attributes), while 
others respond to changes in population cutoffs (for in¬ 
stance: infrastructures with high fixed costs). Although 
there seems to be no single "good” definition to study 
urban scaling, what we found is that while selecting 
one for comparison in time or between national systems, 
this choice should depend on the attribute under consid¬ 
eration. In particular, the minimum population used to 
identify cities plays a major role in the scaling variation, 
and should be integrated fully in the interpretation of 
results. 

As a limitation to this work, we could point to the 
fact that the assumption that cities are monocentric (by 
attaching commuters to a single centre) could be refined 
to provide results more in accordance with the diversity 
of urban forms (and in particular polycentricity, cf. HZI). 
Also, according to Guilluy ITOl . the "fracture” between 
central metropolises and peripheral suburbs and rural 
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Table 4: Maximum scaling registered in the literature compared to estimated values with a systematic definition of French cities 


Urban attribute 

Max. interval for clusters 

in literature [ref] 

^rnax irt literature [ref] 

^uu 

^AU 

Total Area 

[0.334; 1.291] 

0.676 im 

1.163 |22| 

0.959 

0.995 

Road Length 

[0.664; 1.202] 

0.667 (T3| 

0.86 na 

0.903 

0.888 

Research 

[0.951; 1.496] 

1.174 ||5l 

1.67 11201 

1.094 

1.079 

Health and Social 

[0.898; 1.171] 

0.95 ||20l 

0.98 iini 

1.136 

1.013 


For more values from the literature and the corresponding definition of cities (territory, date, delineation), cf. 
Supplement 


Figure 6: Variation of urban scaling with population cutoff for official definitions 
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spaces in France is recent and increasing. It favours 
the concentration of extremes categories of workers 
and social classes in globalised metropolises and rejects 
low and middle social classes to a peripheral France 
of suburbs and small cities. Provided the collection of 
temporal data, our method should allow to test these 
assumptions quantitatively by showing larger variations 
of urban scaling with city definitions over time for eco¬ 
nomic and sociologic categories. 

With this insight and a deeper search for processes 
linking intra-urban features to interurban scaling, a logi¬ 
cal continuation of this work would be to build models 
able to simulate empirical scaling patterns. These mod¬ 
els (i.e. models that seek to explain the role played by 
city size in the distribution of functions and infrastruc¬ 
tures) should now account for the contrasted behaviours 
of the different attributes, especially at the intra-urban 
scale. That is, if absolute values of urban scaling are 
meaningless, there is no point in trying to validate 
models against them. Instead, the validation goal of 
generative models should be to reproduce variations in 
scaling with respect to the variations of the definitional 
criteria. 
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Supplementary Materials 

I. Correlation between systematic clusters and official definitions 


Figure SI: Correspondence between clusters definitions and official cities 
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11. Description and sources of urban attributes used in the paper 


Table Sl: Sources of data used in the paper 


Type of data 

Name 

Description 

Source 

Geography 

GeoFla 2013 

CORINE LandCover 
MOBPRO 2011 

Shape Files 

Land use 
Commuting flows 

w w w. da ta. gou v. f r 

WWW. statistiques. developpement-dur able. gouv. f r 
www.insee.fr 


base-cc-emploi-pop-active-2011 

households etc. 

www.insee.fr 


CLAP 2011 

Jobs by sector 

www.insee.fr 


equip-tour-transp 2013 

Infrastructures 

www.insee.fr 

Attributes 

equip-serv-sante 2013 

Hospitals 

www.insee.fr 


equip-serv-ens-sup-form-serv 2013 

Universities 

www.insee.fr 


OpenStreetMap 2014 
Sit@del2 2011 

Length of roads'^ 
Housing permits 

w w w. da ta. gouv. f r 
w w w. da ta. gouv. f r 


"^primary, secondary, tertiary, motorway and trunk categories added together 


Note: The term "worker” here stands for the French category ouvrier, that is a social occupation defined 
by the census, which corresponds partially to a plant or manual worker. Proximity services relate to jobs in 
everyday services except distribution, transportation, education and health. For example, they include jobs such 
as hairdressers or laundry services. Source: http ://www. insee.fr/fr/themes/det ail. asp?reg_id=99&ref_id= 
analyse Housing permits are counted as the ones authorised in 2011. Source: https://www.data.gouv.fr/fr/ 
datasets/permis-de-construire-pc-permis-d-amenager-pa-et-declaration-prealable-dp-sit-del2/ 

Paris, Lyon and Marseille are here considered as single communes (i.e. not disaggregated into smaller units known as 
arrondissements). 
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III. Residuals from scaling regression 


Figure S2: Residuals of regression with population (in logs) 



From left to right: Residuals in the Number of Unoccupied Secondary Dwellings, Number of Jobs in Manufacturing, 
Number of Dwellings. Residuals are obtained from a regression in logs. Red values correspond to a value for the 
attribute higher in reality than expected with respect the size of the city, blue means that scaling over-estimates the 
value for the attribute based on the size only. Bright colours indicate large deviations from the scaling estimation. 
Cluster Definition: D = 4, f = 40, P = 0. 
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IV. Scaling and allometry form the literature 


Table S2: Diversity of urban scaling exponents from the literature 


VARIABLE 


Cl (95%) 


DATE 

COUNTRY 

URBAN DEFINITION 

N 

REF 


1.163 


0.774 

1980 

USA 

"cities” 

366 

m 


1.043 


0.903 

1990 

East Anglia 

"cities" 

70 

m 


0.808 


0.756 

1990 

SouthEast UK 

"cities" 

801 

m 

TotalArea 

1.014 


0.76 

2001 

Europe 

"cities" 

386 


0.946 



2001 

UK 

"cities" 

67 

f5l 


0.765 


0.637 

2001 

UK 

metropolitan local authorities 

100 

HI 


0.676 


0.309 

2005 

USA 

SMSA 

355 

III 


0.85 

[0.84;0.86] 

0.93 

2010 

USA 

Urban Areas 

3540 

m 


0.86 

[0.84;0.88] 

0.92 

2011 

USA 

Urban Areas 

441 

m 

Road Length 

0.667 


0.65 

2010 

USA 

MSA 

50 

m 


0.849 

[0.81;0.89] 

0.65 

2006 

USA 

metropolitan areas 


m 


1.211 


0.63 

1987 

USA 

MSA 

227 

ID 


1.174 


0.67 

1997 

USA 

MSA 

266 

m 

Research 

1.185 


0.69 

2002 

USA 

MSA 

278 

(31 


1.54 



2000 

USA 

SMAs 

331 

m 


1.67 

[1.54;1.80] 

0.64 

1999 

France 

Aires urbaines 

350 

m 

Health & Social 

0.95 



2000 

USA 

SMAs 

331 

m 

0.98 

[0.94;1.02] 

0.89 

1999 

France 

Aires urbaines 

350 

ii 
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